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(is  this  conclusion  definitely  true,  possibly  true,  or  never  true?). 
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Sternberg,  Note  ll/  to  plausible  alternative  models  that  hare  been  proposed 
in  the  past.  Recent  comparisons  of  the  models  using  a response-selection 
task  have  provided  convincing  evidence  of  the  superiority  of  the  transitive- 
chain  model  for  this  particular  task^Guyote  & Sternberg,  Note  l),  and 
the  present  research  seeks  to  extend-^these  findings  to  the  response- 
evaluation  task.  The  second  goal -of  the  researcBpwas  to  separate  experi- 
mentally the  premise  encoding  and  premise  combination  stages  of  syllogistic 
reasoning,  thereby  enabling  (a)  more  direct  tests  of  the  various  models' 
assumptions  about  each  stage  than  has  been  possible  in  previous  research, 
and  (b)  more  direct  inferences  regarding  the  representations  of  relations 
betveen  the  subject  and  predicate  of  the  premises  as  encoded  and  combined 
by  subjects.  This  second  goal  was  accomplished  by  a modified  form  of 
componential  analysis-!  (Sternberg,  1977 1978 ) ^ whereby  an  information- 
processing task  is  decomposed  into  a series  of  nested  subtasks  that 
permit  isolation  of  the  elementary  components  of  task  performance. 
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Abstract 
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The  present  research  sought  to  understand  the  components  of  syllogistic 
reasoning  used  in  a syllogistic  evaluation  task.  In  this  task,  subjects 
must  indicate  whether  a conclusion  such  as  "Some  Yale  professors  are  hum- 
bugs" is  definitely  true,  possibly  true,  or  never  true  of  a set  of  premises 
such  as  "Some  humbugs  study  syllogistic  reasoning;  some  Yale  professors 
study  syllogistic  reasoning."  A modified  form  of  componential  analysis 
(Sternberg,  1977,  197$)  was  used  to  decompose  the  syllogistic  evaluation 
task  with  abstract  content  into  encoding  and  encoding  plus  combination 
subtasks.  The  response-choice  data  from  these  subtasks  were  used  to  pro- 
vide (a)  more  direct  tests  of  various  models  of  syllogistic  reasoning 
than  has  been  possible  in  the  oast,  and  in  particular,  of  their  assump- 
tions about  sources  of  error  in  syllogistic  reasoning;  and  (b)  more  di- 
rect inferences  regarding  the  representation  of  relations  between  the 
subject  and  predicate  of  the  premises  as  encoded  and  combined.  The 
results  supported  a transitive-chain  model  of  syllogistic  reasoning 
(Guyote  & Sternberg,  Note  1). 
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Components  of  Syllogistic  Reasoning 

Of  the  many  types  of  reasoning  problems  that  have  been  studied  by  both 
psychologists  and  philosophers,  none  has  induced  more  research  than  has  the 
categorical  (or  Aristotelian)  syllogism.  Certainly,  no  other  type  of  reason- 
ing problem  has  retained  so  much  interest  for  so  long.  The  ancient  Greeks 
were  avid  students  of  the  syllogism,  and  the  problem  continues  to  generate 
active  theoretical  controversy  even  in  the  research  of  today. 

A categorical  syllogism  is  a problem  with  two  premises,  the  first  of 
which  is  called  the  major  premise  and  the  second  of  which  is  called  the  minor 
premise.  The  major  premise  describes  a quantified  relation  between  a syllo- 
gistic predicate  and  a middle  term.  The  minor  premise  describes  a quantified 
relation  between  a syllogistic  subject  and  a middle  term.  In  the  syllogismt 
"Some  humbugs  study  syllogistic  reasoning;  some  Yale  professors  study  syllo- 
gistic reasoning,"  for  example,  the  major  premise  describes  a relation  between 
the  predicate  humbugs  and  the  middle  term  syllogistic  reasoning;  the  minor 
premise  describes  a relation  between  the  subject  Yale  professors  and  the  middle 
term  syllogistic  reasoning.  The  subject's  task  is  either  to  (a)  produce  a logically 
valid  conclusion  relating  the  subject,  Yale  professors , to  the  predicate,  humbugs , 
(b)  select  one  of  four  conclusions  as  the  logically  valid  one,  or  (c)  evaluate 
the  validity  of  a given  conclusion.  In  the  evaluation  task,  for  examnle,  the 
subject  might  be  presented  with  the  conclusion  "Some  Yale  professors  are  humbugs," 
The  subject's  task  is  to  decide  whether  the  conclusion  follows  deductively  from 
the  premises,  irrespective  of  its  real-world  trutn  or  falsity.  This  particular 
conclusion  is  deductively  invalid. 

The  research  to  be  described  in  this  article  sought  to  understand  th  com- 
ponents of  syllogistic  reasoning  that  are  used  in  the  evaluation  task.  The-  re- 
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search  had  tvo  major  goals.  The  first  vas  to  compare  one  particular  model  of 
syllogistic  reasoning,  the  transitive-chain  model  (Guyote  & Sternberg,  Note  1), 
to  plausible  alternative  models  that  have  been  proposed  in  the  past.  Recent 
comparisons  of  the  models  using  a response  selection  paradigm  have  provided 
convincing  evidence  of  the  superiority  of  the  transitive- chain  model  for  this 
particular  task  (Guyote  & Sternberg,  Note  1),  and  the  present  research  seeks 
to  extend  these  findings  to  the  response-evaluation  paradigm.  The  second 
goal  of  the  research  was  to  separate  experimentally  the  premise  encoding  and 
premise  combination  stages  of  syllogistic  reasoning,  thereby  enabling  (a) 
more  direct  tests  of  the  various  models'  assumptions  about  each  stage  than 
has  been  possible  in  previous  research,  and  (b)  more  direct  inferences  re- 
garding the  representations  of  relations  between  the  subject  and  predicate 
of  the  premises  as  encoded  and  combined  by  the  subjects.  This  second  goal 
was  accomplished  by  a modified  form  of  componential  analysis  (Ste-nberg,  1977,  1978), 
whereby  an  information-processing  task  is  decomposed  into  a series  of  nested 
subtasks  that  permits  isolation  of  the  elementary  comnonents  of  task  performance. 

Models  of  Syllogistic  Reasoning 
I&ses  for  Distinguishing  Models 

The  experiment  to  be  described  later  permitted  comparative  tests  among 
six  models  of  syllogistic  reasoning:  a baseline  "'ideal- subject”  model,  an 
atmosphere  model  (Woodworth  ’<  Sells,  1935),  a conversion  model  (Chapman  & 

Chapman,  1959),  a random-combination  model  (Erickson,  197*0,  a complete-combi- 
nation model  (Erickson,  197M , and  a transitive-chain  model  (Guyote  !t  Sternberg, 

% 

Noto  1).'  There  arc  two  primary  bases  for  distinguishing  the  rredictions  of  the 
six  models.  First,  the  models  differ  in  whether  they  predict  that  subjects  make 
errors  in  (a)  neither  encoding  nor  combination  of  premises  (ideal  model),  (b) 
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encoding  but  not  combination  of  premises  (conversion  and  complete-combination 
models),  (c)  combination  but  not  encoding  of  premises  (transitive-chain  model), 
v--'  (d)  encoding  and  combination  of  premises  (atmosphere  and  random- combination 
models).  Second,  the  models  differ  in  the  kinds  of  errors  subjects  are  theo- 
rized to  make  in  encoding  and/or  combination.  These  differences  are  what 
give  each  model  its  distinctive  character. 

Verbal  Descriptions  of  Models 

Ideal  model . The  ideal-subject  model  may  be  viewed  as  the  baseline  against  . 
which  the  other  alternative  models  should  be  compared.  It  is  a competence  model 
assuming  that  no  errors  are  made  in  performance.  A performance  model  can  be 
considered  minimally  viable  only  if  its  empirical  predictions  are  superior  to 
those  of  this  model.  In  this  model,  the  subject  is  viewed  as  an  expert  logician. 
The  subject  encodes  all  premises  in  a logically  correct  anl complete  manner.  Then 
the  subject  combines  the  information  that  has  been  encoded  in  a logically  correct 
and  complete  manner.  The  subject's  performance,  then,  is  flawless. 

Atmosphere  model.  Woodworth  and  Sells  (1935)  took  a rather  dim  view  of 
subjects'  logical  abilities,  proposing  a model  that  was  almost  as  extreme  in  its 
postulation  of  alogicality  a3  the  ideal  model  is  extreme  in  its  postulation  of 
complete  logicality.  According  to  the  atmosphere  model,  subjects  always  encode 
the  polarity  (affirmative  or  negative)  arid  quantification  (universal  [all]  or 
particular  [some])  of  each  premise.  In  combining  information  from  the  premises, 
the  presence  of  a negative  in  one  or  both  premises  leads  to  a preference  for 
a negative  conclusion.  The  presence  of  a particular  in  one  or  both  premises  leads 
to  a preference  for  a particular  conclusion.  If  both  a negative  and  a particular 
appear  in  the  premises,  then  the  preferred  conclusion  is  a particular  negative. 
Note  that  in  the  limiting  case  of  this  model,  the  subject  is  utterly  alogical, 
responding  only  on  the  basis  of  surface  features  of  the  premises. 
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Conversion  model.  Whereas  Woodworth  and  Sells  (1935)  suggested  that  sub- 
jects are  largely  alogical  in  their  combination  of  premise  information.  Chapman 
and  Chapman  (1959)  suggested  that  subjects  combine  premise  information  in  a 
logically  correct  and  complete  manner.  Subjects'  errors  derive  from  the  con- 
version of  premises  in  the  encoding  stage  of  syllogistic  reasoning.  In  other 
words,  both  a premise  and  its  converse  are  assumed  to  be  true.  For  example, 
the  premise  "All  A are  B"  is  interpreted  to  mean  that  "All  A are  B and  all  B^ 
are  A."  "Some  A are  B_"  is  interpreted  to  mean  that  "Some  A are  B and  some  B 
are  A."  A problem  arises  because  although  the  meaning  of  the  latter  premise 
is  not  changed  when  this  premise  is  converted,  the  meaning  of  the  former  premise 
is  changed.  Similarly,  the  meaning  of  the  premise  "No  A are  B"  (or,  equiva- 
lently, "All  A are  not  B")  is  not  changed  by  conversion,  whereas  the  meaning 
of  the  premise  "Some  A are  not  B"  is  changed. 

Random-combination  model.  According  to  this  model,  as  proposed  by  Erickson 
(197*0 » subjects  encode  only  certain  single  representations  of  the  set  relations  be- 
tween the  two  terms  of  a premise;  other  representations  are  never  encoded.  The 
five  possible  set  relations  are  shown  in  Figure  1.  Consider,  for  example, 

Insert  Figure  1 about  here 

the  premise  "All  A are  B . " Erickson  has  suggested  that  with  probability  .75, 
subjects  encode  the  relation  between  A and  B_  as  one  of  equivalent  sets;  with 
probability  .25,  subjects  encode  the  relation  as  one  of  subset  (A)  - set  (3). 

Other  set  relations  are  not  used.  Note  that  although  both  relations  arc 
correct,  only  one  is  encoded.  Note  also  that  both  of  the  two  possible  correct 
relations  ore  used  at  least  sometimes.  This  is  not  always  the  case,  however. 

In  representing  the  premise  "Some  A are  13,"  subjects  are  assumed  to  encode 
the  net  (A)  - subset  (3)  relation  with  probability  .25,  and  the  overlapping 
set  relationship  with  probability  .75;  although  relations  of  equivalent  sets 


and  subset  (A)  - set  (3)  are  also  logically  correct,  they  are  never  used.  The 
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probabilities  are  suggested  by  Erickson  as  approximations,  not  final  values. 

^ The  combination  nrocess,  like  the  encoding  process,  is  incomplete.  Sub- 

i Jects  choose  at  random  one  of  the  possible  ways  of  combining  two  representations 

(one  from  each  premise).  For  example,  suppose  that  £ and  £ are  overlapping  sets, 
and  A is  a subset  of  B.  Then  A and  £ might  be  disjoint;  or  they  might  be  over- 
lapping; or  A might  be  a subset  of  £.  The  subject  chooses  one  of  these  relations 
at  random  in  the  combination  process. 

Complete -combination  model.  This  model,  also  proposed  by  Erickson  (197U ) , 
makes  the  same  assumptions  about  encoding  as  does  the  random  combination  model. 
Hie  difference  between  models  lies  in  the  combination  stage.  In  this  stage, 
subjects  are  assumed  to  combine  the  two  representations  (one  from  each  premise) 
in  a logically  correct  and  complete  manner,  that  is,  in  all  possible  valid  ways. 
In  the  example  given  above,  therefore,  the  subject  would  compute  all  three 
possible  relations  between  A and  £. 

Transitive-chain  model.  In  this  model,  proposed  by  Guyote  and  Sternberg 
(Note  1),  subjects  are  assumed  to  encode  premises  in  a logically  correct  and 
complete  manner.  Errors  can  result  from  the  combination  stage,  however.  First, 
subjects  are  assumed  to  combine  a maximum  of  four  pairs  of  representations. 

This  maximum  is  set  by  the  limits  of  working  memory.  Consider,  for  example, 
the  premises  "Some  £ are  £.  All  A are  B. " Complete  encoding  of  the  first 
premise  results  in  four  set  relations:  £ and  £ equivalent,  £ and  £ overlapping, 
£ as  a subset  of  £,  £ as  a superset  of  £.  Complete  encoding  of  the  second 
premise  results  in  two  set  relations:  A and  £ equivalent,  A as  a subset  of  £. 
There  are  eight  possible  ways  to  combine  the  first  four  set  relations  with  the 
second  two  set  relations.  Subjects  use  a maxinum  of  four,  however.  Second, 
identical  representations — A and  £ equivalent — are  combined  before  symmetrical 
representations — A and  £ overlapping,  A and  £ disjoint — which  are  combined 

1 
1 

before  asymmetrical  representations — £ as  a subset  of  £,  A as  a sunerset 

1 

of  £.  In  the  present  context,  a symmetrical  relation  is  defined  as  one  in  which 

k J » - 
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the  positions  of  A and  13  could  he  reversed  without  changing  the  meaning  of  the 
representation.  In  an  asymmetrical  relation,  the  positions  of  A and  B 
cannot  he  changed  without  changing  the  meaning  of  the  relation. 

Quantification  of  Models 

The  random-combination,  complete -combination,  and  transitive- chain 
models  were  specified  by  their  original  authors  in  sufficient  detail  to  per- 
mit quantification.  Quantification  of  the  ideal  model  is  straightforward, 
since  it  always  predicts  the  logically  correct  answer.  The  atmosphere  and 
conversion  models  were  not  originally  specified  in  sufficient  detail  to  per- 
mit quantification,  and  so  we  have  added  what  we  believe  to  be  minor  assump- 
tions that  are  consistent  in  spirit  with  the  original  models.  Quantification 
procedures  for  each  of  the  six  models  are  described  in  the  appendix,  although 
reading  of  this  appendix  should  be  deferred  until  reading  of  the  Method  sec- 
tion is  completed. 

Inevitably,  the  numbers  of  free  parameters  will  vary  across  the  alterna- 
tive models  because  of  their  radically  different  assumptions  about  the  sources 
of  error  in  syllogistic  reasoning.  At  one  extreme,  the  ideal,  atmosphere, 
and  conversion  models  as  formulated  in  this  article  permitted  predictions  of 
responses  without  estimation  of  any  parameters.  At  the  other  extreme,  the 
random- and  complete- combination  models  as  formulated  here  required  as  many  as 
nine  parameters.  The  transitive  chain  model  required  four.  The  differences 
in  numbers  of  parameters  obviously  must  be  taken  into  account  in  assessing 
comparative  model  fits.  As  it  turned  out,  however,  there  was  little  rela- 
tionship between  the  number  of  parameters  in  a model  and  its  fit  to  the  data. 
In  view  of  this  fact,  and  the  fact  that  our  primary  goal  was  to  determine 
which  of  the  models  best  fit  the  data,  regardless  of  the  number  of  sources  of 
error  it  implied,  we  were  not  particularly  concerned  about  differences  in 

- — I I I 


' rV 


Components  of  Reasoning 

8 


/I 


6 


numbers  of  parameters. 

Method 

Subjects 

Subjects  vere  6U  Yale  College  students  participating  for  credit  in  an 
introductory  psychology  course.  Thirty-seven  of  the  subjects  vere  men,  and 
27  vere  vomen.  None  of  the  subjects  had  ever  had  a course  in  formal  logic. 
Materials 

Premises.  Premises  vere  presented  in  the  context  of  either  of  tvo  tasks, 
an  encoding  task  or  a combination  task.  In  the  encoding  task,  subjects  vould 
receive  a premise  such  as  "Some  A are  not  B,"  and  a conclusion  such  as  "No 
A are  B."  The  subject  vould  have  to  indicate  vhether  the  conclusion  vas  def- 
initely, possibly,  or  never  true  of  the  single  premise.  In  this  particular 
case,  the  logically  correct  ansver  is  "possibly  true,"  since  if  at  least  some 
A are  not  B,  it  is  possible  that  in  fact  none  of  the  A are  B.  In  the  combi- 
nation task,  subjects  vould  receive  a pair  of  premises  such  as  "Some  13  are  £. 

All  are  A,"  plus  a conclusion,  such  as  "Some  A are  C. " The  subject  vould 
have  to  indicate  vhether  the  conclusion  vas  definitely,  possibly,  or  never 
true  of  the  premises  considered  together.  In  this  particular  case,  the  logi- 
cally correct  ansver  is  "definitely  true,"  since  for  the  given  premises,  there 
vill  alvays  be  at  least  one  A that  is  a £. 

Premises  in  the  encoding  task  used  the  letters  A and  B.  Premises  in  the 
combination  task  used  the  letters  A,  B,  and  C_  to  denote  the  subjec* , middle 
term,  and  predicate  of  each  syllogism.  All  premises  in  both  tasks  vere  either 
affirmative  or  negative,  and  universal  or  particular,  yielding  four  oossible 
statements:  All  A are  3,  Some  A are  13,  No  A_  are  B,  Some  A are  not  B.  In  the 
encoding  task,  these  four  statements  served  as  the  single  premises  presented 
to  subjects.  In  the  combination  task,  there  vere  15  different  pairs  of  premises. 
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11  of  which  had  at  least  one  valid  conclusion  and  '4  of  which  had  no  valid 
conclusion  among  the  standard  subset  of  four  (described  below).  The  pairs 
of  premises  are  shown  in  Table  5. 

Conclusions.  Each  premise  or  pair  of  premises  was  presented  with  only  a 
single  conclusion  at  a time.  Over  the  course  of  the  experiment,  however,  sub- 
jects received  the  premises  together  with  either  a full  set  of  10  conclusions 
or  a standard  subset  consisting  of  b conclusions  from  the  full  set.  The  sub- 
set consisted  of  the  conclusions  commonly  used  in  syllogistic  reasoning  tasks: 
All  A are  £,  Some  A are  £,  No  A are  £,  Some  A are  not  £.  (in  the  encoding 
task,  B was  substituted  for  £. ) The  full  set  of  conclusions  consisted  of  the 
standard  subset,  plus  six  additional  conclusions:  All  £ are  A,  Some  £ are  not 
A,  All  A are  £ and  all  £ are  A,  Some  but  not  all  A are  £ and  some  but  not  all 
£ are  A,  All  A are  £ and  some  £ are  not  A,  All  £ are  A and  some  A are  not  £. 
(in  the  encoding  task,  3 was  substituted  for  £.  ) The  reason  for  including 
these  particular  additional  conclusions  will  be  discussed  later. 


The  basic  design  of  the  experiment  was  a tvo-by-two  factorial  arrangement , 
with  task  (encoding  or  combination)  crossed  with  number  of  conclusions  (standard 
subset  or  complete  set).  Each  of  the  four  conditions  involved  testing  of  16 
subjects,  none  of  whom  overlapped  between  conditions.  The  dependent  variable 
was  each  subject's  response  of  "definitely  true,"  "possibly  true,"  or  "never 
true"  to  each  conclusion  for  each  premise  or  pair  of  premises. 

Procedure 

Subjects  were  tested  in  small  groups.  At  the  beginning  of  the  testing 
session,  subjects  were  told  the  nature  of  the  task,  and  that  we  were  concerned 
with  logical  validity  of  conclusions.  Subjects  were  warned  that  "possibly  true" 
was  not  intended  as  a weak  form  of,  or  hedge  for,  "definitely  true."  It  was  to 
be  used  only  when  the  conclusion  was  neither  definitely  true  nor  never  true  of 
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the  premise (s).  As  is  customary  in  experiments  on  syllogistic  reasoning,  sub- 
jects were  informed  that  the  logical  meaning  of  some  is  "at  least  some  and 
possibly  all,"  which  contrasts  with  the  everyday  meaning  of  some,  "some  but  not 
all."  After  the  instructions  were  completed,  subjects  tried  one  practice 
problem  with  eacu  type  of  conclusion  they  would  later  receive  in  the  test 
trials.  Thus,  subjects  in  the  subset  groups  received  four  practice  items, 
and  subjects  in  the  full-set  groups  received  ten  practice  items. 

When  they  were  ready  to  begin  the  experimental  task,  subjects  were  told 
that  problems  mu3t  be  solved  in  the  (random)  order  in  which  they  were  presented, 
that  they  were  not  to  refer  back  to  previous  problems  in  solving  new  problems, 
and  that  they  should  respond  as  accurately  as  possible.  The  subjects  then 
began  the  solution  of  test  problems,  which  were  printed  one  to  a page  in  com- 
puter-generated booklets.  Subjects  had  as  much  time  as  they  needed  to  complete 
the  problems,  which  numbered  U0  in  the  encoding  task  with  the  complete  set  of 
conclusions,  16  in  the  encoding  task  with  the  standard  subset  of  conclusions, 

150  in  the  combination  task  with  the  complete  set  of  conclusions,  and  60  in  the 
combination  task  with  the  standard  subset  of  conclusions. 
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Basic  Statistics 

For  the  subset  of  conclusions  common  to  both  the  group  receiving  the  full 
set  of  conclusions  (hereafter,  the  "full-set  group'!)  and  the  group  receiving  the 
subset  of  conclusions  (hereafter,  the  "subset  group"),  the  mean  proportion 
of  logically  correct  responses  in  the  encoding  task  was  .90  for  the  full-set 
group  and  .90  for  the  subset  group.  The  mean  proportion  of  logically  correct 
responses  in  the  combination  task  was  .79  for  the  full-set  group  and  .7**  for 
the  subset  group.  An  analysis  of  variance  revealed  a highly  significant  ef- 
fect of  task,  F(l,60)  = 21.1*1,  jd  <.001,  but  trivial  effects  of  group,  F(l,60)  < 1 
and  task  by  group  interaction,  F(l,60)  < 1. 

These  data  indicate  that  the  combination  task  is  harder  than  the  encoding 
task,  vhich  is  to  be  expected  since  the  combination  task  requires  prior  en- 
coding of  the  premises  of  the  problem.  Performance  on  the  encoding  task  is 
considerably  better  than  would  be  expected  from  the  predictions  of  most  of 
the  performance  models,  but  not  as  good  as  the  perfect  performance  predicted 
by  the  ideal  and  transitive-chain  models.  In  order  to  evaluate  performance 
on  this  task  and  the  combination  task  more  precisely,  it  is  necessary  to  ex- 
amine the  results  of  the  comparative  model  testing. 

Comparative  .*'oiel  Testing 

The  alternative  models  were  quantified  according  to  the  principles  de- 
scribed earlier  and  u3ing  the  procedures  outlined  in  the  appendix.  The  models 
were  tested  through  model  predictions  derived  in  two  ways.  First,  parameter 
estimates  derived  from  the  selection  task  for  abstract  syllogisms  in  Experiment 
1 of  Guyote  and  Sternberg  ("ote  1)  were  entered  into  the  appropriate  equations 
for  the  present  evaluation  task,  and  used  ns  the  basis  for  predicting  responses 
Second,  parameters  were  estimated  from  the  present  data  sets  and  used  n3  the 
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basis  for  predicting  -espouses.  The  first  procedure  enabled  us  to  test  the 
stability  and  general- ty  of  parameter  estimates  over  subjects  (with  two  sam- 
ples drawn  from  the  s Jie  population)  and  over  tasks  (response  selection  in 
Guyote  & Sternberg  [Note  l]  versus  response  evaluation  in  the  present  experi- 
ment; combination  in  Guyote  & Sternberg  [Note  l]  and  the  present  experiment 
versus  encoding  in  the  present  experiment  cnly).  Since  the  number  of  data 
points  was  also  greater  in  the  Guyote  and  Sternberg  experiment  than  in  the 
present  experiment  (l80  versus  a range  of  1 6 to  150 — see  Tables  1 and  2), 
the  parameter  estimates  from  the  selection  task  could  be  expected  to  be 
somewhat  more  constrained  than  those  from  the  present  evaluation  task.  The 
second  procedure  enabled  us  to  test  how  well  the  models  fit  the  data  when 
parameters  were  estimated  specifically  for  the  tasks  and  subjects  at  hand. 

Encoding.  Table  1 shows  comparative  model  fits  for  the  encoding  task. 

Insert  Table  1 about  here 

Model  fits  are  presented  for  three  data  sets.  The  first  data  set  is  for  the 
standard  subset  of  conclusions  (All  A are  £;  No  A are  C;  Some  A are  £;  Some 
A are  not  £)  as  responded  to  by  the  subset  group.  The  second  data  set  is  for 

I 

the  same  subset  of  conclusions  as  responded  to  by  the  full-set  group.  The 
third  data  set  is  for  the  full  set  of  conclusions  as  responded  to  by  the  full- 
set  group.  Model  fits  are  presented  in  terns  of  the  proportion  of  variance 
in  the  data  accounted  for  by  each  model  (R  ) and  the  root-mean-square  deviation 
of  the  observed  from  the  predicted  values  ( RMSD ) . Each  data  point  was  a mean 
over  subjects  of  the  proposed  response  (1=  definitely  true,  2=possibly  true, 

3=never  true)  of  the  subjects  to  each  conclusion  for  each  of  the  four  premises. 
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The  reliability  of  each  data  set  was  computed  by  arbitrarily  dividing 
the  subjects  into  two  groups  of  equal  size,  computing  the  correlation  across 
item  type3  between  quantified  responses  for  the  two  sets  of  subjects,  and 
correcting  this  correlation  by  the  Spearman-Brown  formula  to  take  into  ac- 
count the  fact  that  only  half  the  subjects  were  used  for  computing  the  values 
of  each  variate. 

With  the  Guyote-Sternberg  parameter  estimates,  the  ideal-subject  and 
transitive-chain  models  do  an  excellent  Job  of  predicting  subjects'  responses 
for  all  three  data  sets.  The  complete-  and  random-combination  models  and  the 
atmosphere  model  clearly  do  not  predict  the  data  as  well,  despite  the  large 
number  of  parameters  estimated  for  the  first  two  models.  The  conversion  model 
makes  predictions  indistinguishable  from  those  of  the  ideal-subject  and  transi- 
tive-chain models  for  the  subset  of  conclusions,  but  its  predictions  are  dis- 
tinguished for  the  full  3et  of  conclusions;  and  for  this  set,  the  conversion 
model  is  inferior  to  the  ideal-subject  and  transitive-chain  models.  These 
data  are  therefore  consistent  with  the  notion  that  subjects  approximate  a 
strategy  of  complete  and  correct  encoding  of  the  syllogistic  premises. 

The  parameters  as  estimated  directly  from  the  present  data  are  less 
useful  in  distinguishing  among  models.  Possible  reasons  for  this  lessened 
distinguishability  are  discussed  in  the  next  section.  Whatever  the  reasons, 
it  appears  that  the  larger  numbers  of  parameters  in  the  complete-  and  random- 
combination  models  buy  virtually  nothing  in  the  present  analysis  in  terms  of 
improved  prediction  of  the  complete-  and  random-combination  models  over  the 
ideal-subject  and  tran3itive-chain  models. 

It  is  important  to  note  that  not  even  these  two  preferred  (and  indistinguish- 


able) models  for  the  encoding  task  perform  at  a level  comparable  to  the  reliability 
of  the  data.  Thu3,  the  Ideal-subject/tran3itive-chain  strategy  of  complet 
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and  correct  encoding  in  be  viewed  only  as  an  approximation  to  the 
strategy  or  strategic*  subjects  actually  use.  A true  model  of  performance 
in  the  encoding  task  would  have  to  account  for  all  of  the  systematic  variance 
in  the  data,  something  none  of  the  models  tested  is  able  to  do. 

Is  nearly  flawless  encoding  characteristic  of  individual  as  well  as 
group  performance,  or  might  it  be  some  averaging  artifact  that  does  not  in 
fact  veridically  represent  the  performance  of  individual  subjects?  The 
answer  to  this  question  was  sought  through  the  modeling  of  individual  data. 

Table  2 shows  individual  model  fits  for  members  of  the  subset  group.  Fits 
for  each  model  are  presented  both  in  terms  of  R and  RMSD.  Because  the 
models  predict  probabilities  of  responses,  parameter  estimation  for  indi- 
vidual subjects  would  be  possible  only  with  very  large  numbers  of  replications. 
Since  there  were  no  replications  for  individual  subjects  in  this  experiment, 
the  group  parameter  estimates  (from  Experiment  1 of  Ouyote  ?«  Sternberg,  TIote  1) 
were  used  in  the  model  fitting.  The  absence  of  replications  also  made  it 
impossible  to  compute  reliability  of  the  individual  data  in  the  standard  fashion. 
The  reliability  index  (or  square  root  of  the  reliability  coefficient)  was  there- 
fore estimated  as  the  correlation  between  an  individual's  pattern  of  data  and 
the  group  pattern  of  data  (with  that  individual  excluded).  This  estimate  of 
the  reliability  index  is  conservative,  since  it  assumes  that  the  group  pattern 
represents  the  true  pattern  (over  infinite  replications)  for  each  individual. 


Insert  Table  2 about  here 


The  first  thing  to  note  in  the  table  is  that  despite  the  absence  of  repli- 
cated observations  for  individual  subjects,  the  data  for  these  subjects  were 
highly  reliable.  The  estimated  reliability  index  was  at  least  .85  in  15  of  l6  cases, 
at  least  .9Ciin  11  of  16  cases.  Subject  10,  with  an  index  of  .58,  was  an  anomaly. 
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The  second  thing  to  note  is  that  a model  of  flawless  encoding  best 

2 

accounted  for  the  data  of  almost  all  of  the  subjects  (13  of  l6  using  R 
as  the  criterion,  or  1**  of  1 6 using  RMSD  as  the  criterion).  This  model 
accounted  for  all  of  the  variance  in  the  data  of  6 of  l6  subjects,  and 
the  mean  R across  subjects  was  .84,  a most  respectable  level  of  fit  for 
individual  data.  This  mean  is  reduced  considerably  by  the  anomalous 
Subject  10,  whose  data  were  of  low  reliability  and  for  whom  none  of  the 
models  were  satisfactory.  (Note  that  for  the  subset  of  conclusions,  the 
predictions  of  the  conversion  model  cannot  be  distinguished  from  those 
of  the  ideal  and  transitive-chain  models.  The  data  for  the  full  set  of 
conclusions  does  distinguish  models,  and  the  individual  data  for  the  full 
set  of  conclusions,  like  the  group  data,  argue  strongly  against  the  con- 
version model.) 

2 

In  general,  the  patterns  of  RMSD  closely  follow  those  of  R . This  con- 
gruence of  outcome  is  to  be  expected  where  subjects  use  the  scale  of  response 

2 

choices  in  much  the  same  way,  since  the  major  difference  between  R and  RMSD 
2 

is  that  R allows  an  additive  constant  in  evaluating  fit,  whereas  RMSD  does  not. 
Individual  differences  in  the  use  of  the  response  scale  (l=definitely  true, 
2=possibly  true,  3=never  true)  appear  to  have  been  quite  small.  Mean  scale 
values  averaged  across  items  ranged  from  1.94  to  2.31  for  individual  subjects, 
with  a grand  mean  of  2.04  and  a standard  deviation  of  .03.  The  ideal  subject 
would  have  shown  a mean  of  2.00.  Although  subjects  differed  little  among 
themselves,  and  9 of  l6  subjects  had  means  of  exactly  2.00,  the  mean  across 
subjects  (2.04)  did  differ  significantly  from  the  ideal  mean  (2.00),  = 2.00, 

£ < .05.  Subjects  were  on  the  average  slightly  more  conservative  than  the  ideal 
subject  would  be,  as  would  be  expected  if  they  sometimes  failed  to  apprccia  j 
all  possible  encodings  of  a premise.  Standard  deviations  of  responses  for 
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individual  subjects  anged  from  .73  to  .93,  with  a mean  across  subjects  of 
.87  and  a standard  deviation  of  .05.  This  mean  standard  deviation  was 
significantly  lower  than  that  which  would  be  shown  by  the  ideal  subject  (.89), 
z - -2.15,  £ < .05,  indicating  that  subjects  were  less  variable  in  their 
responses  than  an  ideal  subject  would  have  been. 

Combination.  Table  3 shows  comparative  model  fits  for  the  combination 
task.  Reliability  of  the  data  was  computed  in  the  same  way  as  for  the  encoding 
task.  Model  testing  based  upon  the  Guyote-Sternberg  parameter  estimates 

Insert  Table  3 about  here 

once  again  indicates  the  superiority  of  the  transitive-chain  model  over  the 

2 

alternative  models.  This  model  has  the  highest  value  of  R and  the  lowest 

2 

value  of  RMSD  for  each  of  the  three  data  sets.  The  absolute  levels  of  fit 

axe  also  quite  respectable,  especially  for  the  subset  of  conclusions.  As  in 

2 

the  encoding  task,  however,  the  values  of  R are  lower  than  the  reliability, 
indicating  the  presence  of  systematic  variance  unaccounted  for  by  any  of  the 
models.  None  of  the  models  is  true,  therefore,  in  accounting  for  subjects' 
strategies  in  encoding  and  combining  premise  information. 

As  was  the  case  in  the  analyses  for  the  encoding  task,  the  models  are 
less  distinguishable  when  parameters  are  estimated  for  the  present  data.  Why 
do  parameter  estimates  from  the  evaluation  task  consistently  distinguish  models 
less  well  than  the  estimates  from  the  selection  task?  There  seem  to  be  three 


L 


possible  reasons,  any  or  all  of  which  may  contribute  to  lessened  distinguishabil ity 
in  the  evaluation  task.  First,  the  number  of  data  points  from  which  parameters 
were  estimated  was  larger  in  the  Guyote-Sternberg  experiment  than  in  the  present 
experiment,  so  that  there  was  less  opportunity  for  capitalization  upon  chance 
in  the  earlier  experiment.  In  models  with  relatively  large  numbers  of  narameters 
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(such  as  the  complete-  and  random-combination  models),  capitalization  upon  chance 

can  be  a serious  problem.  Second,  the  response-evaluation  task,  used  in  the 

present  experiment  eliminates  the  comparison  stage  of  syllogistic  reasoning, 

vhich  is  a major  source  of  variation  in  the  differential  predictions  of  the 

alternative  models.  Third,  there  seems  to  be  at  least  some  difficulty  in  the 

combination  task  in  distinguishing  between  relations  that  have  not  been  encoded 

3 

and  those  that  have  been  encoded  but  not  combined.  The  encoding  task,  therefore, 
is  useful  in  showing  the  very  high  level  of  performance  subjects  reach  in 
encoding  when  encoding  is  isolated  from  combination. 

Table  4 shows  individual  model  fits  for  members  of  the  subset  group. 
Procedures  for  computing  the  individual  model  fits  and  reliability  indices  were 
the  same  as  in  the  analysis  of  the  individual  encoding  data. 

Insert  Table  4 about  here 

The  data  are  again  highly  reliable:  The  mean  correlation  between  indi- 
vidual and  group  data  was  as  high  as  .81*.  The  transitive-chain  model  was 
again  the  preferred  model,  best  accounting  for  the  data  of  7 of  l6  subjects 

with  R as  the  criterion,  or  8 of  l6  subjects  with  RMSD  as  the  criterion. 

2 

The  absolute  levels  of  fit,  with  mean  R = .72  and  mean  RMSD  = .41*,  were  also 

quite  respectable  for  individual-subject  data.  In  all  but  two  cases  where 

a model  other  than  the  transitive-chain  model  was  preferred,  the  difference 
2 

in  R and  RMSD  between  the  transitive-chain  model  and  the  preferred  model  was 
small  (between  .01  and  .03  'units  of  RMRD).  In  two  cases,  however  (Subjects 
9 and  11),  the  individual  data  were  clearly  better  fit  by  the  ideal-subject 
model  than  by  any  other  model.  It  thus  appears  that  a small  minority  of  sub- 
jects are  able  not  only  to  encode  premises  almost  perfectly,  but  to  combine 
them  almost  perfectly  as  well.  On  the  whole,  however,  the  individual  data 
argue  in  favor  of  the  general  superiority  of  the  nix’d  model  over  its  competitors. 
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Variation  in  use  of  the  response  scale  was  relatively  snail,  although 
not  as  small  as  in  the  encoding  task.  Mean  responses  for  individual  sub- 
jects ranged  from  1.85  to  2.23,  with  a mean  of  2.02  and  a standard  deviation 
of  .085.  Standard  deviations  (representing  variability  in  responses)  ranged 
from  .70  to  .92,  with  a mean  of  .80  and  a standard  deviation  of  .073. 

Parameter  Estimates 

Compari son  across  subjects  and  tasks . One  basis  for  assessing  the 
adequacy  of  a model  is  the  stability  of  the  model's  parameter  estimates 
across  subjects  and  task  variants.  Stable  parameter  estimates  indicate 
that  the  model  is  generalizable  across  subjects  and  tasks,  and  that  capitali- 
zation upon  chance  is  not  a major  factor  in  determining  fit  of  the  model 
for  any  one  task  or  group  of  subjects.  Unstable  parameter  estimates,  on 
the  other  hand,  call  into  question  the  genei^alizability  of  the  model  and 
the  interpretability  of  any  one  set  of  parameter  estimates. 

Table  5 shows  parameter  estimates  for  the  three  models  for  which  parame- 

Insert  Table  5 about  here 

ters  were  estimated.  Since  encoding  is  assumed  to  be  complete  and  correct 
in  the  transitive- chain  model,  there  are  no  parameters  associated  with  the 
encoding  stage  of  this  model,  and  parameters  were  estimated  for  the  combination 
task  only.  Our  first  concern  was  to  assess  the  stability  of  the  parameter 
estimates  across  subjects  and  tasks.  This  assessment  was  made  by  connuting 
the  root-mean-square  deviation  of  the  estimates  across  subjects  and  tasks. 

These  RMSDs  are  thus  across  pairs  of  columns  for  a given  model  in  Table  5. 

For  the  transitive-chain  model,  RMSD  is  .15  between  the  Ouvote-Oternberg 

and  Sternberg-Turner  data.  RMSDs  were  higher  for  the  alternative  models:  For  the 


complete-combination  model,  the  corresponding  value  of  RM3D  is  .22,  and  for 
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the  random-combination  model,  it  is  .36.  Thus,  the  parameter  estimates  of 
the  transitive-chain  model  do  appear  to  be  more  stable  and  generalizable 
across  subjects  and  tasks,  perhaps  because  of  the  reduction  in  chance  fluc- 
tuations often  associated  vith  estimation  of  a smaller  number  of  parameters. 

The  values  of  RMSD  were  also  computed  between  the  encoding  and  combination 
tasks  in  the  present  experiment.  The  values  of  RMSD  were  .15  for  the  complete- 
combination  model  and  .20  for  the  random-combination  model. 

Interpretation  of  parameter  estimates . The  value  of  p^  indicates  that 
subjects  combine  only  one  pair  of  set  relations  about  half  the  time,  and 
combine  more  than  one  (p^+p^+p^)  the  other  half.  This  restriction  in  the 
number  of  combinations  performed  is  assumed  to  be  due  to  the  limitations 
of  working  memory. 

The  parameters  of  the  complete-  and  randon-combination  models  signify 
probabilities  of  encoding  different  premises  by  means  of  various  set  relations 
(see  Table  5).  For  example,  the  probability  of  encoding  "All  A are  B"  as 
A equivalent  to  B is  estimated  to  be  .80  in  the  present  data  for  the  complete- 
combination  model.  Erickson's  model  as  originally  formulated  contained  only 
what  are  here  called  p^,  p^,  p^>  and  p^.  One  of  these  parameters,  p^,  is  es- 
timated as  0 in  all  but  one  data  set,  where  it  is  estimated  as  .01.  The  aug- 
mentation of  the  models  proposed  here  considerably  improves  the  predictive 
power  of  the  models,  however,  in  that  most  of  parameters  p^-p^  have  values 
that  depart  substantially  from  either  0 or  1.  Apparently,  "Some  A are  B"  and 
"Some  A are  not  B"  do  have  a variety  of  interpretations  assigned  to  them,  if 
one  accepts  either  of  these  models  as  valid  (which  we  do  not). 
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Modified  'Truth  Table  Analysis 

The  global  quantitative  analyses  of  model  fits  and  parameter  estimates 
do  not  enable  one  to  assess  qualitative  features  of  the  data,  such  as  the  nature 
of  the  representations  subjects  use.  We  have  examined  these  qualitative  fea- 
tures of  the  data  through  a modified  truth-table  analysis  similar  in  some 
respects  to  those  employed  by  Staudenmayer  (1975)  and  by  Taplin,  Staudenmayer , 
and  Taddonio  (1971*)  for  conditional  premises.  However,  our  methodology  in 
conducting  this  analysis  was  quite  different  from  anything  that  has  been  at- 
tempted before. 

Consider  as  an  illustration  the  premises  "All  13  are  £.  All  A are  B." 

We  wish  to  know  how  subjects  represent  the  set  relation(s)  that  characterize(s) 

(a)  each  of  the  promises  and  (b)  the  combined  premises.  We  will  deal  in  this 
illustration  only  with  item  (b),  since  the  principles  are  the  same  for  item  (a). 

There  are  five  possible  set  relations  relating  A to  C:  A equivalent  to  £, 

A superset  of  £,  A subset  of  £,  A overlapping  with  £,  disjoint  with  £.  A 
subject's  combined  representation  may  contain  any  of  the  2^-1  = 31  possible 
nonnull  subsets  of  these  f^ve  set  relations:  If  a subject  has  a logically 
consistent  (but  not  necessarily  correct)  representation,  it  will  be  one  of  these 
31  possible  one3.  Obviously,  some  combined  representations  are  much  more  plausi- 
ble than  others.  In  the  present  example,  three  plausible  combined  representations 
might  be  "A  equivalent  to  £, ” "A  subset  of  £,"  and  "A  equivalent  to  £ or  A subset 
of  £• " (The  last  representation  is  logically  complete  and  correct.)  Is  there 
some  way  of  finding  out  which  of  these  (or  other)  representations  subjects  ac- 
tually used?  lur  proposed  way  is  through  the  modified  truth-table  analysis. 

Consider  an  example  of  such  an  analysis  presented  in  Table  6.  Each  of 

Insert  Table  6 about  here 
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the  10  conclusions  for  the  full-set  group  is  shown  at  the  left,  and  three 
columns  are  shown  at  the  right,  one  for  each  of  the  three  plausible  represen- 
tations. It  turns  out  that  each  of  these  three  representations  predicts  a 
different  pattern  of  "definitely  true,"  "possibly  true,"  and  "never  true" 
responses  to  the  10  conclusions  given  to  the  full-set  group,  and  in  fact, 
each  of  the  31  possible  representations  predicts  a different  pattern.  The 
10  conclusions  were  chosen  so  as  to  yield  a unique  pattern  of  responses  that 
would  permit  us  to  distinguish  the  best  (most  frequently  used)  of  the  31 
possible  representations. 

Suppose  that  a subject  believes  the  correct  combined  representation  to 
be  that  of  "A  equivalent  to  £. " (He  or  she  does  not  realize  that  "A  subset  of 
£"  is  also  possible. ) Then  the  conclusion  "All  A are  £"  is  definitely  true  of 
this  representation.  This  conclusion  is  also  definitely  true  of  each  of  the 
other  three  plausible  representations  of  the  combined  set  relation,  so  that 
the  conclusion  is  not  helpful  in  distinguishing  which  representation  the  sub- 
ject actually  uses.  In  fact,  all  four  of  the  standard  syllogistic  conclusions 
(All  A are  £,  No  A are  £,  Gone  A are  £,  Some  A are  not  £)  given  to  the  subset 
group  yield  the  3?_me  pattern  of  predictions,  so  that  one  could  not  distinguish 
among  the  three  plausible  representations  on  the  basis  of  responses  to  these 
conclusions:  "All  A are  C"  is  definitely  true  of  all  of  them;  "No  A art  C"  is 
never  true  of  any  of  then;  "Gome  A are  £"  is  definitely  true  of  all  of  them; 
and  "Gome  A are  not  C"  is  never  true  of  any  of  them.  Consider,  however, 
the  conclusion,  "All  £ are  A."  This  conclusion  leads  to  a different  response 
for  each  of  the  plausible  representations.  It  is  definitely  true  of  "A  equivalent 
to  £,"  It  is  never  true,  however,  of  "A  subset  of  £,"  since  in  this  conclusion, 
some  but  not  all  of  the  £ ore  A.  It  is  possibly  true  of  the  third  representation, 
in  that  it  is  true  of  one-half  of  the  representation  (A  equivalent  to  C)  but 
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not  of  the  other  half  subset  of  £). 

We  computed  the  root-mean-snuare  deviation  between  the  pattern  of  re- 
sponses to  the  10  conclusions  predicted  by  each  of  the  31  logically  con- 
sistent representations  (again  using  l=definitely  true,  2=possibly  true, 
3=never  true)  and  both  (a)  the  observed  pattern  of  responses,  and  (b)  the 
pattern  of  responses  predicted  by  each  of  the  models  of  syllogistic  reason- 
ing. Parameter  estimates  from  the  present  experiment  were  used  in  making 
these  predictions.  It  was  possible  through  these  computations  to  infer 
for  the  full-set  group  both  the  representations  they  actually  used  for 
each  syllogism  and  the  representation  they  were  alleged  uO  use  under  the 
assumptions  of  each  model.  It  seems  likely  that  there  would  have  been  in- 
dividual differences  in  the  representations  used  by  different  subjects,  and 
even  in  the  representations  used  by  single  subjects  at  different  times. 

We  examined  only  mean  data,  however,  because  of  its  greater  stability  and 
because  the  extremely  high  internal-consistency  reliability  of  the  group 
data  for  split  halves  of  subjects  suggested  that  individual  differences  •/ere 
not  a major  influence  upon  the  results. 

Table  7 shows  the  representation  with  the  lowest  RM5D  for  each  of  the 
encoding  and  conDination  problems.  The  models  differed  in  the  accuracies 

Insert  Table  J about  here 

with  which  they  predicted  these  representations.  The  average  rank  order 
of  the  best-fitting  representation  (as  determined  by  the  actual  data) 
among  the  31  possible  representations  was  computed  for  each  model  for  both 
the  encoding  and  combination  tasks.  A rank  of  1,  for  example,  would  indicat 
that  the  best  representation  according  to  the  model  is  also  the  one  .he  data 


Components  of  Reasoning 


indicate  subjects  use.  A rank  of  2 would  indicate  that  the  actual  represen- 
tation was  ranked  only  second  best  by  the  model . The  average  rank  orders  for 
the  respective  tasks  were  1.00  and  1.^0  for  the  ideal-subject  model,  1.00 
and  l.Uo  for  the  transitive-chain  model,  1.25  and  2.^7  for  the  complete- 
combination  model,  1.25  and  2.60  for  the  random-combination  model,  1.50 
and  2.20  for  the  conversion  model,  and  5.00  and  6. 1*7  for  the  atmosphere  model. 

These  results  indicate  that  the  ideal-subject  and  transitive-chain  models 
were  better  than  the  alternative  models  in  predicting  the  best-fitting  represen- 
tation, although  these  two  models  were  not  distinguishable,  possibly  because 
subjects  receiving  the  full  set  of  conclusions  had  more  of  an  opportunity  to  reflect 
upon  possible  representations  for  the  combined  premises,  and  thus  may  have 
more  closely  resembled  ideal  subjects. 
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Discussion 

The  present  experiment  investigated  subjects'  performance  in  two 
tasks,  one  requiring  only  encoding  of  the  premises  of  syllogisms,  the  other 
requiring  both  encoding  and  combination  of  premises.  A major  finding  was 
that  subjects  made  only  about  10%  errors  in  the  encoding  task.  Since  this 
figure  includes  constant  sources  of  error  as  well  as  error  due  specifically 
to  encoding,  the  result  suggests  that  errors  in  encoding  are  not  a major 
source  of  difficulty  in  syllogistic  reasoning.  Modeling  of  the  encoding  data 
in  the  present  experiment  also  supports  the  notion  that  encoding  is  not  a 
major  source  of  response-choice  variation  in  the  syllogisms  task.  The 
ideal-subject  and  transitive-chain  models,  which  predict  error-free  encoding, 
performed  better  than  the  alternative  models  that  were  considered.  The 
differentiation  among  models  showed  up  clearly  only  when  the  Guyote-Sternberg 
parameter  estimates  were  used,  presumably  in  part  because  of  the  greater 
constraints  in  these  earlier  data.  To  summarize,  then,  the  present  data 
suggest  that  encoding  is  nearly  error-free,  and  that  most  errors  occur  in 
combination  (and  in  the  response-selection  task,  in  comparison  as  well). 

This  conclusion  is  opposed  to  that  of  certain  other  authors,  most 
notably  Ceraso  and  Provitera  (1971).  Like  us,  Ceraso  and  Provitera  claimed 
to  test  the  encoding  hypothesis  directly.  But  whereas  we  sought  to  test 
the  encoding  hypothesis  by  decomposing  the  syllogistic  evaluation  task  and 
by  manipulating  the  conclusions  given  to  the  full-set  group,  Ceraso  and 
Provitera  sought  to  test  the  hypothesis  by  manipulating  the  nature  of  the 
premises  given  to  a "modified"  syllogisns  group.  The  authors  nroposed  that 
most  errors  in  syllogisms  tasks  are  attributable  to  encoding  failure  rather 
than  to  reasoning  errors.  They  had  subjects  solve  either  traditional 
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syllogisms  (using  a variant  of  the  response-selection  task)  or  modified 
syllogisms.  The  latter  syllogisms  were  "modified"  in  the  sense  that  sub- 
jects were  taught  to  represent  each  premise  by  just  one  set  relation. 

Previous  investigations  by  Ceraso  (Note  2)  had  suggested  that  many  people 
in  fact  represent  the  premises  of  traditional  syllogisms  only  by  single 
set  relations. 

Ceraso  and  Provitera  interpreted  their  results  as  supporting  the  en- 
coding hypothesis.  This  interpretation  bears  further  examination.  It  ap- 
pears that  the  performance  of  subjects  in  the  modified  group  serves  only  as 
a poor  model  for  the  performance  of  subjects  in  the  traditional  group:  The 
squared  correlation  between  response-choice  probabilities  of  subjects  in  the 
two  groups  was  only  .69.  One  is  not  safe  in  concluding,  therefore,  that 
subjects  "responded  to  the  traditional  syllogisms  as  if  they  were  the  modi- 
fied syllogisms,  which  accounted  for  their  errors"  (Ceraso  & Provitera,  1971 » 
p.  Uoo). 

How  well  would  the  transitive-chain  model  fit  Ceraso  and  Provitera 's 
data.r’  Of  the  13  syllogisms  used  by  Ceraso  and  Provitera,  11  were  used  in 
Experiment  1 of  Guyote  and  Sternberg  (Note  l).  We  used  the  parameter  esti- 
mates from  Guyote  and  Sternberg  (Note  1)  for  the  ll  common  syllogisns  to  pre- 
dict Ceraso  and  Provitera' s data  for  the  traditional-syllogisms  groun.  Since 
Ceraso  and  Provitera  did  not  have  as  a conclusion  "Some  A are  not  £,"  we  combined 
this  conclusion  with  "Some  A are  C, " which  could  be  expected  slightly  to  reduce 
the  fit  of  the  transitive-chain  model.  Recall  also  that  Ceraso  and  Provitera 
did  not  teach  subjects  to  use  the  logical  meaning  of  some,  whereas  Guyote  and 
Sternberg  did.  This  difference  could  also  be  expected  to  reduce  the  fit. 
Nevertheless,  the  squared  correlation  between  the  predictions  of  the  transitive- 


chain  model  for  Guyote  and  Sternberg'3  data  and  Ceraso  and  Provitera's  traditional- 
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group  data  was  .91.  The  squared  correlation  between  the  modified-group 
and  traditional-group  data  for  these  same  11  syllogisms  was  .69  (the  same 
as  for  the  full  set  of  13  syllogisms).  Thus,  the  predictions  of  the  transi- 
tive-chain model  for  different  subjects  in  a different  experiment  adminis- 
tered with  different  instructions  were  considerably  better  than  predictions 
based  upon  the  modified-group  data. 

Ceraso  and  Provitera,  recognizing  that  the  traditional -group  data  were 
not  well  predicted  by  the  modified-group  data,  suggested  two  other  sources 
of  error  in  syllogistic  reasoning.  The  first  was  atmosphere  of  the  premises. 
The  second  was  incomplete  combination  of  set  relations.  Both  of  these 
sources  of  error  are  predicted  by  the  transitive-chain  model  of  syllogistic 
reasoning,  according  to  which  working  memory  limitations  result  in  incomplete 
information  processing  during  the  combination  stage,  and  atmosphere  of  the 
premises  serves  as  one  heuristic  for  choosing  a preferred  conclusion  during 
the  comparison  stage  (see  Guyote  & Sternberg,  Note  l). 

According  to  Ceraso  and  Provitera,  a subset  of  five  of  the  syllogisms 
they  used  provided  a particularly  important  test  of  their  model,  in  that 
neither  of  the  two  additional  sources  of  error  noted  above  should  have  opera- 
ted differentially  for  the  traditional-  and  modified-syllogisms  groups.  For 
these  five  syllogisms,  the  squared  correlation  between  response-choice 
patterns  in  the  two  groups  was  .92.  Three  of  these  syllogisms  were  also  used 
by  Guyote  and  Sternberg  (Note  l).  The  squared  correlation  between  the  predic- 
tions of  the  transitive-chain  model  for  the  Guyote-Sternberp'  data  and  Ceraso 
and  Provitera.’.s  response-choice  data  was  .96.  The  comparable  squared  correla- 
tion for  these  three  syllogisms,  using  the  modified-group  data  to  predict 
the  traditional  group  data,  was  also  .96. 


We  conclude  from  these  analyses  that  the  transitive-chain  model  provider; 
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as  good  an  account  as  Ceraso  and  Provitera's  encoding  hypothesis  for  the 
small  subset  of  the  data  considered  immediately  above,  and  provides  a much 
better  account  of  the  data  as  a whole  than  does  the  encoding  hypothesis 
taken  alone.  Taken  as  a whole,  therefore,  the  results  of  the  present  experi- 
ment as  well  as  Ceraso  and  Provitera's  are  consistent  with  the  notion  that 
encoding  errors  are  not  a major  source  of  response-choice  variation  in 
syllogistic  reasoning.  If  subjects  are  not  instructed  to  use  the  logical 
meaning  of  some,  it  seems  likely  that  response-choice  variation  due  to  the 
encoding  stage  may  increase.  But  this  increased  variation,  to  whatever 
extent  it  may  exist  in  Ceraso  and  Provitera's  and  other  experiments,  seems 
likely  to  be  due  not  to  an  (interesting)  inability  to  translate  verbal 
descriptions  into  set  relations,  but  to  a (less  interesting)  ignorance  on 
the  part  of  subjects  of  what  a particular  verbal  description  is  supposed  to 
mean  in  sentential  logic. 

Modeling  of  the  combination  data  also  supported  the  transitive-chain 
model,  at  least  for  the  standard  subset  of  conclusions.  The  data  from 
the  subset  of  conclusions  seem  to  rule  out  the  ideal -subject , conversion, 
and  atmosphere  models  relative  to  the  transitive-chain  model.  The  complete- 
and  random-combination  models  are  ruled  out  only  if  the  Guyote-Stemberg 
parameter  estimates  are  used.  The  transitive-chain  model,  however,  showed 
more  stable  parameter  estimates  across  subjects  and  tasks  than  did  either 
the  complete-  or  random-combination  models. 

Using  a modified  truth-table  analysis,  we  were  able  to  infer  the  repre- 
sentations used  by  full-set  subjects  in  encoding  and  combining  premises.  It 
was  noted,  however,  that  the  additional  conclusions  given  to  full-set  subjects 
in  order  to  enable  us  to  infer  their  representations  may  have  resulted  in  their 
becoming  aware  of  additional  possible  set  relations. 

Although  the  results  of  this  experiment  and  the  previous  ones  (Guyote  A 
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Sternberg,  Note  l)  generally  support  the  transitive-chain  model,  there  is 
systematic  response-choice  variation  left  unaccounted  for  in  all  of  the 
experiments  conducted  so  far.  Neither  the  transitive-chain  model,  nor  any 
other  model  considered,  for  that  matter,  is  true.  The  transitive-chain  model 
is  no  doubt  a simplification  of  the  model  subjects  actually  use.  First, 
it  does  not  fully  account  for  performance  during  encoding.  The  evidence 
suggests  that  although  encoding  is  not  nearly  so  flawed  as  some  investigators 
have  believed,  neither  is  it  flawless , as  assumed  by  the  transitive-chain 
model.  Second,  the  model  does  not  fully  account  for  the  effects  of  syllo- 
gistic figure  (the  order  in  which  the  terms  appear  in  each  of  the  premises) 
upon  response  choice.  Previous  evidence  (for  example,  Dickstein,  1978)  sug- 
gests that  figure  has  fairly  complex  effects  unon  subjects'  choices  of  re- 
sponses. A more  sophisticated  model,  therefore,  might  take  into  account  all 
of  these  complex  effects.  Attempts  at  the  formulation  of  such  a model  are 


presently  being  made 
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Appendix 

Quantification  of  Models  of  Syllogistic  Reasoning 
In  this  appendix,  the  quantification  of  the  models  of  syllogistic  reasoning 
vill  be  described  in  some  detail.  Each  description  vill  be  based  upon  an  ex- 
ample from  the  combination  task.  All  £ are  B..  Some  A are  not  B. 

Ideal  Model 

Encoding.  According  to  this  model,  each  premise  is  encoded  completely  and 
correctly.  Thus,  the  first  premise  of  the  example  syllogism  is  represented  as 

la.  C equivalent  to  B. 

lb.  £ subset  of  B. 

and  the  second  premise  is  represented  as 
2a.  A overlapping  with  B. 

2b.  A disjoint  vith  _B. 

2c.  A superset  of  B. 

Combination.  The  model  also  assumes  the  the  premises  are  combined  completely 
and  correctly.  Each  of  the  tvo  set  relations  from  the  first  premise  can  be  com- 
bined vith  each  of  the  three  set  relations  from  the  second  premise,  yielding  six 
combinations. 

la  2a.  A overlapping  vith  £. 
la  & 2b.  A disjoint  vith  £. 
la  & 2c.  A superset  of  £. 

lb  & 2a.  A overlapping  vith  C or  A disjoint  vith  C or  A superset  of  C_. 
lb  & 2b.  A disjoint  vith  C. 
lb  & 2c.  A superset  of  £. 

The  representations  resulting  from  the  various  combinations  are  obviously  non- 
unique. In  fact,  there  are  exactly  three  distinct  set'  relations  between  A and  f 


A overlapping  vith  C,  A disjoint  vith  £,  A superset  of  £. 
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Response . Responses  to  the  various  conclusions  are  chosen  on  the  basis  of 
this  combined  representation.  Consider  as  examples  the  four  conclusions  from 
the  standard  subset.  All  are  C_"  fails  to  describe  any  of  the  three  composite 
set  relations,  and  so  is  never  true.  "No  A are  C"  correctly  describes  one  of 
the  three  composite  set  relations  (A  disjoint  with  C),  and  so  is  possibly  true. 
"Some  A are  C"  correctly  describes  two  of  the  three  relations  (A  overlapping 
with  £,  A superset  of  £),  and  so  is  also  possibly  true.  "Some  A are  not  £" 
correctly  describes  all  three  of  the  relations,  and  so  is  definitely  true.  In 
quantifying  predicted  and  observed  values,  a response  of  "definitely  true"  was 
assigned  a numerical  value  of  1;  a response  of  "possibly  true"  was  assigned  a 
numerical  value  of  2;  a response  of  "never  true"  was  assigned  a numerical  value 
of  3.  4 

Transitive-Chain  Model 

Encoding.  The  transitive  chain  model,  like  the  ideal  model,  assumes  com- 
plete and  correct  encoding,  so  that  the  discussion  of  encoding  for  the  ideal 
model  is  relevant  here  as  well. 

Combination.  Encodings  la,  2a,  and  2b  are  symmetrical,  and  hence  preferred. 
As  a result,  there  are  two  preferred  pairs  of  set  relations — la  & 2a,  la  & 2b. 
These  will  always  be  combined  first.  Let 
p^  = P(performing  one  combination) 

Pg  = P(perforraing  tvo  combinations) 
p^  = P(performing  three  combinations) 
p^  = P(performirg  four  combinations) 

If  Just  one  combination  is  performed,  it  will  be  with  one  of  the  two  pre- 
ferred pairs  of  set  relations: 

la  & 2a.  A overlapping  with  C. 


or 
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la  & 2b.  A disjoint  with  £. 

Since  each  of  the  two  possible  combinations  is  equally  likely, 

P(A  overlapping  with  £)  = l/2(p^) 

P(A  disjoint  with  £)  = l/2(p1) 

If  two  combinations  are  perforned,  they  will  be  the  two  preferred  ones,  so  that 
P(A  disjoint  with  £,  A overlapping  with  £)  = p2 
Suppose  that  three  combinations  are  performed.  They  will  be  some  subset  of 
the  six  possible  combinations  of  la  and  lb  with  2a,  2b,  and  2c.  Six  combinations 
taken  three  at  a time  results  in  20  possible  subsets.  But  the  two  preferred  pairs 
must  be  included,  so  that  only  one  of  the  four  combinations  is  left  to  vary. 

Pour  things  taken  one  at  a time  results  in  four  possible  subsets.  Listing  of 
the  elements  in  these  subsets  reveals  that 

P(A  disjoint  with  £,  A overlapping  with  £,  £ superset  of  £)  = 3A(p^) 

P(A  disjoint  with  £,  A overlapping  with  £)  = 1/U (p^) 

Suppose,  finally,  that  four  combinations  are  performed.  The  same  logic  applies 
as  was  used  above  in  determining  possible  combinations.  There  end  up  being  six 
possible  subsets.  Each  of  the  subsets  yields  the  same  three  composite  represen- 
tations, so  that 

P(A  disjoint  with  £,  A overlapping  with  £,  A superset  of  £)  = p^ 

To  summarize,  subjects  might  use  any  of  four  possible  representations,  depending 
upon  how  many  and  which  combinations  they  performed.  The  four  possible  represen- 
tations and  their  associated  probabilities  are 
P(A  disjoint  with  £)  = l/2(p^) 

P(A  overlapping  with  £)  = l/2(p^) 

P(A  disjoint  with  £,  A overlapping  with  C)  = p?  + l/Mp^) 

P(A  disjoint  with  £,  A overlapping  with  £,  A superset  of  £)  = 3A(p^)  + p^ 
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Response.  The  - tinted  response  for  each  conclusion  can  now  be  computed. 
Let  us  consider  the  prediction  for  the  conclusion,  "Some  A are  £. " If  the 
subject  uses  the  first  representation  (A  disjoint  with  £),  the  conclusion  is 
never  true.  If  the  subject  uses  the  second  representation  (A  overlapping  with 
£),  the  conclusion  is  definitely  true.  Finally,  if  the  subject  uses  the  third 
or  fourth  representation  (A  disjoint  with  £,  A overlapping  with  C or  A disjoint 
with  £,  A overlapping  with  £,  A superset  of  £) , the  conclusion  is  possibly  true, 
since  at  least  one,  but  not  all  of  the  set  relations  in  each  representation  can 
be  appropriately  described  by  this  conclusion.  We  are  now  able  to  state  that 
P( "definitely  true")  = 1/2(P;l) 

P( "possibly  true")  = P2  + P3  + Pi* 

P("never  true")  = 1/2^) 

It  is  now  possible  to  compute  the  predicted  value  for  the  conclusion,  keeping  in 
mind  that  "definitely  true"  is  assigned  a numerical  value  of  1,  "possibly  true"  a 
numerical  value  of  2,  and  "never  true"  a numerical  value  of  3.  The  predicted 
value  is  simply  the  expected  value  of  the  possible  responses: 

Predicted  value  = l[l/2(p1)]  + 2[p2  + P3  + PjJ  + 3[l/2(p1)] 

= 2(px  + P2  + P3  + p^)  = 2 

In  this  particular  example,  it  is  not  necessary  even  to  estimate  the  individual 
values  of  p^,  P2,  p^,  and  Pj^,  since  they  must  sun  to  1.  In  other  examples,  how- 
ever, the  values  of  the  probabilities  must  be  estimated  in  order  to  predict  the 
response  to  a given  conclusion. 

Random -Combination  Model 

Encoding.  According  to  Erickson  (197M , a given  set  relation  may  have  a 
predefined  probability  of  zero  or  one  of  being  used  to  encode  a given  premise, 
or  it  may  have  an  estimated  probability  of  being  used.  Consider  the  pre-  '.se 
"All  A are  B."  Two  parameters,  which  we  shall  label  p^  and  t>2,  are  associated 
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vith  encoding  of  this  premise: 

p^  = P( encoding  "All  A are  B"  as  A equivalent  to  B) 
p^  = P(encoding  "All  A are  B"  as  A subset  of  B) 

All  other  encodings  are  used  with  probability  zero.  The  premise  "No  A are  B" 
is  encoded  as  A disjoint  with  B^ with  probability  one.  In  Erickson's  formulation 
of  the  random-combination  model,  two  parameters,  which  we  shall  call  p^  and  p^, 
axe  associated  with  the  encoding  of  the  premise  "Some  A are  B": 
p^  = P(encoding  "Some  A are  B"  as  A superset  of  B) 

p^  = P(encoding  "Some  A are  B_"  as  A overlapping  with  B) 

All  other  encodings  are  used  with  probability  zero.  Erickson  apparently  did  not 
inform  his  subjects  of  the  logical  meaning  of  "some,"  as  is  commonly  done  in 
experiments  on  syllogistic  reasoning.  We  did  inform  subjects  of  this  meaning, 
however,  rendering  it  implausible  that  subjects  would  never  use  the  other  two 
set  relations  that  properly  describe  the  premise  "Some  A are  B. " We  therefore 
estimated  two  additional  parameters  for  our  data: 

p,.  = P(encoding  "Some  A are  B"  as  A equivalent  to  B) 

Pg  = P(encoding  "Some  A are  B"  as  A subset  of  B_) 

The  premise  "Some  A are  not  B"  is  proposed  by  Erickson  to  be  encoded  as  A over- 
lapping with  B with  probability  one.  In  order  to  maximize  the  fit  of  the  model 
to  the  data,  three  additional  parameters  were  also  estimated,  corresponding  to 
probabilities  of  encoding  "Some  A are  not  B"  in  various  ways: 
p^  = P( encoding  "Some  A are  not  3"  as  A superset  of  B) 

Pg  = P( encoding  "Some  A are  not  3"  as  A overlapping  with  B) 

Pp  = P(cncoding  "Some  A are  not  B"  as  A disjoint  with  B) 

Note  that  the  model  with  parameters  as  estimated  is  an  augmented  version  of 

Erickson's  original  model.  Its  performance  will  always  be  equal  to  or  bette- 
than  that  of  the  original  model,  since  fixed  constants  in  the  original  model  are 


y 


\ 
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now  estimated  as  ./ee  parameters.  The  augmented  version  of  the  model  retains 
Erickson's  basic  _nformation-processing  framework  while  permitting  more 
flexibility  in  thj  encoding  process. 

Turning  now  to  our  example  syllogism,  we  see  that  the  first  premise  of 
the  example  problem  can  be  encoded  in  either  of  two  ways  (la  or  lb),  and  the 
second  premise  can  be  encoded  in  any  of  three  ways  (2a,  2b,  2c).  The  probability 
that  the  first  premise  of  the  example  will  be  encoded  as  C.  equivalent  to  B 
is  p^;  the  probability  that  it  will  be  encoded  as  C_  subset  of  B is  p^.  The 
probability  that  the  second  premise  of  the  example  will  be  encoded  as  X 
superset  of  13  is  p^;  the  probability  that  it  will  be  encoded  as  A overlapping 
with  B is  Pg;  and  the  probability  that  it  will  be  encoded  as  A disjoint  with 
B is  Pp.  Note  that  in  Erickson's  model,  subjects  always  encode  each  premise 
with  only  one  set  relation. 

Combination.  Since  the  first  premise  can  be  represented  in  two  ways  and 
the  second  premise  in  three  ways,  there  are  six  possible  ways  of  combining  infor- 
mation from  the  two  premises,  which  are  the  same  as  the  six  shown  for  the  ideal 
model  (see  Combination  section).  The  probabilities  of  the  various  combinations 
are  la  with  2a,  la  with  2b,  p^g-,  la  with  2c,  p^^;  lb  with  2a,  PgPjJ 

lb  with  2b,  P^Pg*  ^ with  2c,  p,^.  in  all  of  these  possible  combinations 
except  lb  with  2a,  there  is  only  one  possible  resulting  set  relation,  so  that 
the  probability  of  using  each  set  relation  is  equal  to  the  probabilities  of 
the  various  combinations.  If  lb  is  combined  with  2a,  however,  there  are  three 
possible  resulting  set  relations.  In  the  random  combination  model,  only  one 
of  the  three  combined  set  relations  is  constructed,  however,  and  this  one  is 
chosen  at  random.  Thu3,  the  probability  of  any  one  of  the  three  set  relations 
being  constructed  is  l/3(p0p^). 

We  can  now  compute  the  probability  that  any  given  set  relation  will  result 
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from  the  combination  process: 

P(A  overlapping  vith  C)  = p^  + l/SCp^) 

P(A  disjoint  vith  £)  = p^Pg  + p^g  + l/3(p2P^) 

P(A  superset  of  C)  = p^  + l/SCp^)  + p ^ 

Response . The  predicted  value  of  the  subject's  typical  response  for  each 
conclusion  is  computed  in  the  same  vay  as  for  the  transitive-chain  model.  The 
predicted  response  is  an  expected  value  computed  by  summing  the  products  of 
the  numerical  value  of  each  possible  response  (definitely  true,  possibly  true, 
never  true)  times  the  probability  that  each  response  is  given.  Consider  the 
conclusion  "Some  A are  £."  The  response  "definitely  true"  will  be  given  if 
either  the  set  relation  A overlapping  vith  £ or  the  set  relation  A superset 
of  £ is  used.  The  response  "possibly  true"  vill  never  be  given  in  this  or 
any  other  example,  since  only  one  set  relation  is  ever  used  in  the  combined 
representation.  The  response  "never  true"  vill  be  given  if  the  set  relation 
A disjoint  vith  £ is  used.  The  associated  probabilities  are 
P( "definitely  true")  = p^  + 2/3 (p^)  + P^  + p^ 

P( "possibly  true")  = 0 

P( "never  true")  = p^pg  + PgPg  + l/^fp^) 

The  predicted  value  of  the  response  to  the  conclusion  "Some  A are  £"  is  thus 

Predicted  value  = ltp^  + 2/3(PgP7)  + p^  + p^]  + 2[0]  + 3[p.jP8  + P2Pg  + 
1/3(P2P7)) 

Complete  Combination  Model 

Encoding.  This  model  makes  the  same  assumptions  about  encoding  as  does  the 
random  combination  model. 

Combination.  This  model  differs  from  the  random  combination  model  in  one 
fundamental  respect.  If  a conbined  representation  comprises  more  than  one  set 
relation,  all  set  relations  are  constructed  rather  than  one  being  chosen  s ran- 
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dom.  In  the  example  problem,  therefore, 

P(A  overlapping  with  C)  = 

P(A  disjoint  vith  £)  = p1Pg  + p2pQ  + p2p^ 

P(A  superset  of  C)  = p^Pg  + PgP  + p2?g 

Response.  The  predicted  response  is  computed  in  the  same  way  as  above.  For 
the  conclusion  "Some  A are  £,"  the  probabilities  associated  with  each  possible 
response  are 

P( "definitely  true")  = p^  + p^g  + p^ 

P{"possibly  true")  = p^p^ 

P( "never  true")  = p^Pg  + p^g 

The  predicted  value  of  the  response  to  the  conclusion  "Some  A are  £"  is 
Predicted  value  + Kp^  + P1Pg  + P^)  + 2(Pgp  ) + KVjVq  + P2Pg) 

Conversion  Model 

Encoding.  It  is  assumed  that  eachpremise  is  represented  as  the  conjunction 
of  the  premise  in  its  stated  form  and  the  premise  in  its  converted  form.  Thus, 
the  premise  "All  £ are  £"  in  the  example  is  interpreted  as  meaning  that  "All  £ 
are  £ and  all  £ are  £, " and  is  represented  as  £ equivalent  to  £.  The  premise 
"Some  £ are  not  £"  is  interpreted  as  meaning  that  "Some  A are  not  £ and  some 
£ are  not  A,"  and  is  represented  as  A overlapping  with  £,  A disjoint  vith  £. 

Note  that  under  this  interpretation  of  the  conversion  model,  representations 
of  both  premises  are  different  from  those  obtained  under  the  ideal  model. 

Combination.  Combination  proceeds  as  in  the  ideal  model,  except  that  com- 
bination is  performed  upon  the  representations  as  encoded  by  the  rule  stated 
above.  Combining  the  one  set  relation  from  encoding  of  the  first  premise  vith 
the  tvo  set  relations  from  encoding  of  the  second  prenise  yields  the  combined 
representation  A overlapping  vith  £,  A disjoint  vith  C. 


rectly  to  the  conclusions,  which  are  interpreted  in  unconverted  form.  Consider 
the  conclusion  "Some  A are  £."  The  predicted  value  for  the  response  is  2 ("pos- 
sibly true"),  since  one  but  not  the  other  combined  set  relation  is  properly  de- 
scribed by  this  conclusion. 

Atmosphere  Model 

Encoding.  According  to  this  model,  subjects  are  assumed  to  encode  two  fea- 
tures of  each  premise,  the  quantification  (universal  or  particular)  and  the 
polarity  (affirmative  or  negative).  In  the  example  syllogism,  the  first  premise 
is  universal  affirmative  and  the  second  premise  is  particular  negative. 

Combination.  Combination  proceeds  according  to  two  rules.  First,  if 
either  or  both  premises  is  particular,  then  the  atmosphere  of  the  syllogism 
is  particular;  otherwise,  the  atmosphere  is  universal.  Second,  if  either  or 
both  premises  is  negative,  then  the  atmosphere  of  the  syllogism  is  negative; 
otherwise,  the  atmosphere  is  affirmative.  In  the  example  syllogism,  the  second 
premise  is  both  particular  and  negative;  hence,  the  atmosphere  is  particular 
negative . 

Response.  The  subject  evaluates  the  atmosphere  of  each  conclusion.  If 
the  atmosphere  of  the  combined  premises  matches  the  atmosphere  of  the  conclusion 
in  both  quantification  and  polarity,  the  subject  responds  "definitely  true."  If 
the  atmosphere  of  the  combined  premises  matches  the  atmosphere  of  the  conclusion 
in  either  quantification  or  polarity,  but  not  both,  the  subject  responds 
"possibly  true."  If  the  atmosphere  of  the  combined  premises  matches  the  atmos- 
phere of  the  conclusion  in  neither  quantification  nor  polarity,  the  subject 
responds  "never  true."  Consider,  for  example,  the  conclusion  "Some  A are  £. " 
This  conclusion  is  particular  affirmative.  Since  its  atmosphere  matches  the 
atmosphere  of  the  combined  premioe3  in  quantification  but  not  polarity,  the 
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subject  is  predicted  to  responds  "possibly  true,"  and  the  predicted  value  for 
this  conclusion  is  therefore  2. 
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Footnotes 


Portions  of  this  article  were  presented  at  the  Mathematical  Psychology 
Meetings  in  Los  Angeles,  August,  1977.  We  are  grateful  to  Martin  Guyote 
for  valuable  suggestions  and  comments.  Preparation  of  this  article  was 
supported  by  ONR  Contract  N0001^78C0025  to  Robert  Sternberg;  the  research 
reported  in  the  article  was  supported  by  NSF  Grant  BNS-76-05311  to  Robert 
Sternberg.  Requests  for  reprints  should  be  sent  to  Robert  J.  Sternberg, 
Department  of  Psychology,  Box  11A  Yale  Station,  Yale  University,  New  Haven, 
Connecticut  06520. 

^These  six  models  exhausted  the  plausible  alternatives  that  we  could  find 
that  were  suitable  for  our  purposes.  Since  preparation  of  the  article, 
Johnson-Laird  and  Steedman  (i.978)  have  proposed  a new  model  of  syllogistic 
reasoning.  The  model  of  Ceraso  and  Provitera  (1971)  was  considered,  but  was 
deemed  unsuitable  because  of  its  fundamental  assumption  that  subjects  do  not 
know  the  logical  meaning  of  some.  We,  like  most  investigators,  instructed 
subjects  as  to  this  meaning.  We  attempted  to  quantify  the  model  of  Revlis 
(1975)»  but  found  the  level  of  detail  presented  in  Revlis's  (1975)  article 
insufficient  for  quantification. 

^Root -mean- square  deviations  were  also  computed  for  individual  syllogisms. 
The  ideal-subject  model  had  the  lowest  R!!SD  or  was  tied  for  the  lowest  on  3 of 
the  15  syllogisms,  the  transitive-chain  model  on  12,  the  complete-combination 
model  on  5,  the  random-combination  model  on  6,  the  conversion  model  on  3,  and 
the  atmosphere  model  on  3.  These  numbers  sum  to  greater  than  15  because  of 
the  large  number  of  ties. 

3 

This  observation  was  made  by  !tartin  Guyote. 
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Three  alternative  procedures  were  used  for  prediction  of  response  choices. 

In  the  first,  the  expected  value  of  the  response  for  each  conclusion  (on  the 
1 to  3 scale)  was  used  as  the  predictor,  and  the  observed  response  to  each 
conclusion  (also  on  the  1 to  3 scale)  was  used  as  the  criterion.  In  the 
second  procedure,  computations  were  identical,  except  that  conclusions  desig- 
nated as  logically  valid  (definitely  true)  were  assigned  a value  of  1,  and 
conclusions  designated  as  logically  invalid  (possibly  true  or  never  true) 
were  assigned  a value  of  2.  In  the  third  procedure,  the  expected  proportions 
of  definitely  trues , possibly  trues , and  never  trues  for  each  conclusion  were 
used  as  predictors  (resulting  in  three  times  as  many  data  points  as  for  the 
first  procedure),  and  the  observed  proportions  were  used  as  criteria.  The 
three  methods  gave  very  similar  results,  and  so  only  results  from  the  first 
method  are  reported  here.  Prediction  equations  for  the  method  may  be  ob- 
tained bv  writing  to  the  senior  author. 
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notes:  Idetil -subject  and  transitive-chain  models  make  the  same  predictions  for  the  encoding  task.  Parameters 


Performance  of  Alternative  Models  on  Encoding  Task  for  Individual  Subjects: 

Subset  Group 
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Note.  Complete  and  random  combination  models  make  indistinguishable  predictions  for  the  encoding  task. 
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Table  5 Continued 

= P(Performing  exactly  one  combination) 

P2  = P(Performing  exactly  two  combinations) 

Pg  = P(Performing  exactly  three  combinations) 

P^  = P(Performing  exactly  four  combinations) 

Pj_  = P(encoding  "All  A are  B"  as  A equivalent  to  B) 

Pg  = P(encoding  "All  A are  B"  as  A subset  of  B_) 

Pg  = P( encoding  "Some  A are  B"  as  A superset  of  B) 

= P(encoding  "Some  A are  B"  as  A overlapping  with  B) 

P^  = P (encoding  "Some  A are  B"  as  A equivalent  to  13) 

Pg  = P(encoding  "Some  A are  B"  as  A subset  of  B) 

Py  = P(encoding  "Some  A are  not  B"  as  A superset  of  B) 

Pg  = P( encoding  "Some  A are  not  3"  as  A overlapping  with  B) 

Pp  = P(encoding  "Some  A are  not  B"  as  A disjoint  with  B) 

G-S  = Guyote-Sternberg  data 
S-T  = Sternberg-Tumer  data 

Comb.  = Combination  task 
Enc.  * Encoding  task 


£ 


Table  6 
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Example  of  a Modified  Truth-Table  Analysis 


Premises:  All  B are  £. 

All  A are  B. 


Conclusion 

All  A are  C. 

No  A are  C. 

Some  A are  £. 

Some  A are  not  £. 

All  £ are  A. 

Some  £ are  not  A. 

All  A are  £ and  all  £ are  A. 

Some  but  not  all  A are  £ and 
some  but  not  all  £ are  A 
All  A are  £ and  some  £ are  not  A 
All  C ere  A and  some  A are  not  C 


Set  Relation(s)  Used  to  Represent  A-C 


A equiva- 
lent to  £ 

A subset 

of  £ 

A equiva- 
lent to  £or 

D 

D 

D 

N 

N 

N 

D 

D 

D 

N 

N 

N 

D 

N 

P 

N 

D 

P 

D 

N 

P 

N 

N 

N 

N 

D 

P 

N 

N 

N 

Relation 

: subset 
of  C 


Note:  D = Definitely,  P = Possibly,  N = Never 
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Representations  of  Encoded  and  Combined  Premises 


Encoding;  Task 


Set 

Relation 

Premise 

A,B 

coincident 

A,B  A 

overlapping 

subset 

of  B 

A superset 

of  B 

A disjoint 

vith  £ 

All  A are  B. 

X 

X 

No  A are  B. 

X 

Some  A are  B. 

X 

X 

X 

X 

Some  A are  not 

B. 

X 

X 

X 

Combination  Task 

Set 

Relation 

Premises 

A,C 

A,C  A 

subset 

A superset 

A disjoint 

coincident 

overlapping 

of  C 

of  £ 

with  £ 

All  B are  £. 
All  A are  B_. 

X 

X 

All  £ are  B. 

No  A are  B. 

X 

No  B are  £. 

All  A are  B. 

X 

All  £ are  £. 

Some  A are  not 

B. 

X 

?a 

X 

No  £ are  B. 

Some  B are  A. 

X 

X 

J 


Premises 
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5h 

Table  7 Continued 

A,£  A,£  A subset  A superset  A disjoint 

coincident  overlapping  of  C of  £ with  C 


Some  B are  not  £. 
All  B are  A. 


No  £ are  B. 
All  B are  A, 


All  C are  B. 

All  B are  A.  X ‘ ' x 

All  B are  £. 

X X XX 

All  B are  A. 

Some  B are  £. 

All  B are  A.  X X X x 

All  B are  £. 

X X X X 

Some  A are  3. 

All  B are  £. 

„ x x 

No  A are  B. 

Some  £ are  not  B. 

XX  X 

All  A are  3. 

Some  £ are  B. 

„ x x x x x 

Some  B are  not  A. 

Some  £ are  not  £. 

No  B are  A.  * x x x x 

Note:  An  x in  a given  cell  indicates  that  this  set  relation  is  part  of  the 
encoded  or  combined  representation. 

*Due  to  a tie  between  PNSDs  for  two  representations,  the  inclusion  of  this  so* 
relation  is  questionable. 
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